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Abstract. We present an automatic building type (usage) labeling based on 
the footpri nt data. The usage i nformati on of bui I di ngs i s of great i nterest for 
many applications, eg., navigation, city planning and emergency manage- 
ment. This attribute, however, is generally not provided in the volunteered 
data sources I i ke OpenStreetM ap and i s often i ncompl ete even i n the off i ci al 
cadastral maps. I n this paper, we propose a method to enhance the maps 
with the building usage information exclusively using the geometric and 
topological features in the footprint data. A general category is predefined 
with four classes: residential, commercial, industrial and public. A novel 
inference framework is proposed using two new high-level (composite) ge- 
ometric characteristics for the local description of the individual buildings 
and the Markov Random Field model to incorporate the contextual con- 
straints of the neighborhood. Experiments are performed on both Open- 
StreetM ap and cadastral data showing the potential of the proposed meth- 
od. 
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1. Introduction 

The usage (use and occupancy) information of buildings is of great interest 
for many applications, eg., navigation, city planning, emergency manage- 
ment, etc. This attribute, however, is generally not provided to the buildings 
in a consistent way in the volunteered data sources like OpenStreetM ap 
(OSM). Although a map feature catalogue exists, the volunteers are not 
obliged which attributes they set for the usage information. Even in the offi- 
cial cadastral maps, the building usage information is not always available. 
An approach to enhance OSM data is presented by Werder et al. (2010), in 
which an unsupervised classification of spatial data solely based on the ge- 
ometric and topological characteristics is proposed. Both building outlines 
and road network information are employed. Luscher et al. (2009) present 



a classification of buildings also based on topographic vector data by means 
of an ontology-driven approach. Supervised Bayesian inference is used to 
deal with the vagueness in definitions of spatial phenomena. 

In this work, we classify the usage type of individual buildings in the urban 
area according also exclusively to their geometric and topological features 
derived from the given footpri nt data. A general category is predefi ned with 
four types of usage: (1) residential, eg., single and multiple family houses, 
apartment buildings; (2) commercial, eg., office buildings, supermarkets, 
shopping malls; (3) industrial, eg., factory buildings, warehouses, and (4) 
public, eg., museums, memorials, hospitals, theaters, stadia, universi- 
ties/schools. We propose two new high-level geometric features: "effective 
width" and "branching degree", which are designed to quantify the living 
space and the structural complexity of the buildings, respectively. A novel 
inference framework is presented using (composite) geometric characteris- 
tics for the local description of the individual buildings and the Markov 
Random Field (MRF) model to incorporate the contextual constraints of the 
neighborhood. MRF (Kindermann & Snell, 1980), also known as Markov 
network, is an undirected graph model, in which the random variables hold 
Markov properties (cf. Section 3). It is widely used in image processing and 
computer vision (Li, 2009) for the labeling/ segmentation of the image pix- 
els or sub-regions and other applications like point cloud grouping, eco- 
nomics and sociology. In this work we use the vertices of the MRF to repre- 
sent the individual buildings and the edges between vertices to encode the 
neighborhood constraints. By these means the buildings of dense urban 
areas can be classified and labeled with the above defined usage attributes 
more reasonably considering the geometric features and the neighborhood 
constraints. 

The paper is organized as follows. I n Section 2 we introduce the two new 
high-level composite geometric features, i.e., the effective width and the 
branching degree, and the definition of the local (unary) energy based on 
them. Section 3 presents the modeling of the building network and their 
neighborhood relationships via MRF, the definition of the contextual (bina- 
ry) energy, and the optimization of the overall energy function. Experiment 
results and evaluation are demonstrated in Section 4. The paper ends up 
with conclusions in Section 5. 



2. Building Attributes 

First, we study the contribution of the local geometric features to the identi- 
fication of the building types. One basic attri bute that can be easily derived 
from the footprint data is the building area. It can somehow reflect the 



building usage, e.g., one building smaller than 200 square meters may be a 
si ngl e-fami ly house and that of over 20000 square meters wi 1 1 very I i kely be 
a factory or warehouse. The problems of using such simple measure, how- 
ever, is also clear: e.g., complex buildings such as apartment buildings may 
also have large footprint area and therefore cannot be distinguished from 
i ndustri al or public building without consi deri ng the shape characteri sti cs. 

Considering the shape factor, a simple one can be defined as the ratio of 
building length and width. This L/W ratio helps to differentiate bar-like 
shape (often for residential or industrial buildings) and square-l ike shape 
(often for public or commercial buildings). But it works only well with rec- 
tangular buildings. For complex buildings, although the bounding box can 
be used to calculate L/W ratio, the values cannot reflect their real shape any 
more. 

High-level attributes are therefore required to integrate multiple geometric 
attributes and provide more precise description to the building shape. In 
this work two new composite measures: "effective width" and "branching 
degree" are proposed specifically for the purpose of building usage classifi- 
cation. 

2.1. Effective width 

The effective width (EW) is an estimated width of buildings with arbitrary 
shapes. 1 1 is defi ned as the average width of the footpri nt along the center- 
line. For this we have to determine the centerline of the building skeleton, 
which describe the approximate length of the building. 

Haunert & Sester (2008) compare different types of skeletons that are 
commonly used in geographic information systems for deriving polygon 
centerlines. As the basis of our work, we select a simple skeleton, the 
"straight skeleton", which only comprises straight lines in contrast to the 
"medial axis". The latter comprises also second-order lines, that would 
cause computational overhead. The straight skeleton is presented by Aich- 
holzer etal. (1995) and isexemplarilyshown in Figurel(a). 

Please note, as shown in Figure 1 (building 1), the length of the centerline 
(L c =150 m) can differ from the actual building length (L B =7.50 m).To de- 
rive a reliable Lq to approximate the L B , we modify the original straight 
skeleton by extending the derived centerlines to the building boundary (red 
line in Figure! b). The effective width is practically calculated as the ratio 
of the bui I di ng area (A B ) to the bui I di ng I ength : 



(a) (b) 

Figure L Building skeletons: (a) Straight skeleton of buildings (centerlines bold) 
and (b) modified centerlines (red lines). 

The effective width is of interest in the usage classification because it actual- 
ly implies the general living/ movement space inside the building. By this 
means the residential buildings can be well distinguished from the industri- 
al or public ones. In many building category definitions the single-family 
houses and multi-family houses (e.g., apartment buildings) must be defined 
as two separate classes because their area and complexity are remarkably 
different. Using EW, as demonstrated in Figure 2, the values of these two 
types of residential buildings show consistency, although the building areas 
and the complexities (calculated by "branching degree", cf. Section 2.2) are 
not close to each other. 
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Figure 2. Effective width shows value consistency for the apartment building (top) 
and the si ngle-f ami ly house ( bottom) by i ndicati ng the I i vi ng space of the bui Idi ngs. 

2.2. Branching degree 

Another novel high-level attribute proposed in this work is the branching 
degree(BD). It scores the number and distribution of the bui I ding segments 



(called "branches") derived from the skeleton centerline to measure the 
structural complexity of the building. Please note that in this case the con- 
ventional straight skeleton as shown in Figure 1 (a) is employed for better 
overall structural analysis. 

First, we define the longest linear segment of the skeleton centerline as the 
"trunk" and then the other segments as the "branches". The number, size, 
and branching angle of the branches are integrated as: 

m 

BD = V w t ■ T j — , 

4—* L trunk 
1 = 

where m is the total number of branches and L indicate the length of the i- 
th branch or the trunk (also the 0-th segment, L = L trunk ). The weight of 
each branch or trunk is w t = 2 ■ ajn with a t the intersection angle(in radi- 
ans, a t e (0,7r/2] ) of the current branch to its parent (as shown in Figure 
3). The higher order branches have the same weight as that of the first or- 
der. A set of BD examples are given in Figure 3. Generally, we can imagine 
that the residential (except the apartment complex, cf. Section 2.1) and in- 
dustrial buildings may have lower BD while the public buildings have nor- 
mally higher complexity. One advantage of giving the weight w t is the sim- 
ple building with slight curve shape (Figure 3, building 5) can be better 
scored. The curved centerline in the straight skeleton is represented by a 
chain of linear segment and the weights can help to prevent the BD value 
being too high with the large number of "branches". 
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Figure 3. Definition of trunk/ branches and the intersection angle (top) and exam- 
ple values of the branching degree (bottom). 



2.3. The local energy 

Figure 4 shows a rough sketch to summarize the distribution of building 
clusters with different usage types in the space of EW-BD. Each building 
can be represented as a point in this 2D parameter space and the probabil- 
ity this building belongs to one of the class is inversely proportional to its 
(standardized) distance to the centroid of the class, which is empirically 
given with generic values. 
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Figure 4. Distribution of buildings with different usage types in the parameter 
space of EW-BD. 

The local potential of a building is then defined as a quaternary value of the 
probabilities: 

Plocal = {PR.PcPl.Pp} 

that this building should belabeled. The probabilities are standardized with 
a sum of 1 E.g., if one building with a probability distribution of {p R = 
0.2, p c = 0.1, p, = 0.6, p P = 0.1} is given a label R, the current local energy of 
this vertex is 0.2, if label / then 0.6. 



3. Context model 

In this work, we use the MRF to model the buildings in the dense urban 
area and their neighborhood relationships. We define the graph model G as: 

G = {V, E} , 

where each individual buildings are represented as vertices, v = v u i e V, 
and the edges, e = e(i,j),{i,j} e E, connecting pairs of vertices (cliques). 
Any pair of non-neighbor vertices is conditionally independent given all 
other vertices. 



3.1. The definition of neighborhood 

As shown in Figure 5, the neighborhood of buildings is defined based on the 
determination of Voronoi cells of the buildings centroids (distance-based 
approach). That is, as illustrated in Figure 5 (left), the polygons divide the 
whole area into seamless eel Is. For each centroid there will be a correspond- 
ing region consisting of all points closer to that centroid than to any other 
(Euclidean distance). And thus, all cells which share an edge are called 
neighbors. With the determination of neighbors, MRF holds the Markov 
properties: (1) pairwise Markov property: any two non- neighbor vertices 
are independent; (2) local M arkov property: a vertex is conditionally inde- 
pendent to all other vertices given its neighbors, and (3) global Markov 
property: any two non-adjacent subsets are conditionally independent giv- 
en a separati ng subset. 



Figure 5. Definition of neighborhood of buildings: (left) polygons of buildings and 
their centroids marked with red points and their Voronoi cells; (right) the MRF 
model with the edges connecting neighbor buildings. 

3.2. The overall energy 

The overall energy function of the MRF consists of two components: the 
unary and the bi nary terms: 



The unary energy u(x c t ) summarizes the local features of the individual 
buildings. It is calculated withthelocal potential described in Section 2.3: 

u(Xi,C x ) = Pi cali. x d 

with x t e {R, C,I,P] being the random labeling assignment. It indicates the 
I i kel i hood of the I abel i ng. 





The neighborhood inferences are encoded into the binary term, which im- 
plies the neighborhood plausibility of usage in the pairwise cliques. The 
plausibility is evaluated in two aspects: 

1 Type consi stency: nei ghbor bui I di ngs are i ncl i ned to have the same 
type; 

2. Logical neighborhood: it reflects reasonable city planning for adja- 
cent areas, eg., residential buildings are more likely be found near 
publ i c bui I di ng i nstead of i ndustri al zone. 

The rewards as well as penalties of neighborhood proposals are embedded 
i n a symmetr i c matr i x N : 
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The binary energy of each clique is directly calculated based on this matrix 
as: 

b(x u xj) = N(i,j) . 

The goal is to find the maximums of the graph model. We use X to repre- 
sent the configuration, i.e., a set of label assignments to all the vertices. The 
optimization task can be expressed as: 

«.„ r , M .„ r .j 2 ,,,.„„ 2 H,,,,j 

V 1 w J 

with x u Xj e X. 

3.3. Stochastic sampling 

Dealing with the data for dense urban area, the established MRF model is 
highly connected. In the optimization process the labels of all the vertices 
can be altered and lead to different configurations. These make the optimi- 
zation task being a extremely high-dimensional problem and computational 
intractable for direct solution. In this work we employ statistical approxi- 
mation by means of a Gibbs sampler (Geman & Geman, 1984) for this task. 
A Gibbs sampler is one Markov Chain Monte Carlo (MCMC) algorithm. It 



performs the random sampling specifically from the potentially complicat- 
ed multivariate probability distribution with a large set of variables. 

Let s be the step number and M the corresponding state of the model, the 
sampl i ng process can be summari zed as fol I ows: 

1 Initialization: M s=0 , X 5=0 (giveX( based on unary likelihood only) 

2. Propose a new state M' with the correspondi ng configuration X'. 

2.1Samplenew label for thefirst building 



with n the total number of buildings and x x c {x r , ...,x n } the 
neighbors of building 1 As mentioned before, the current vertex is 
conditionally independent to the non-neighbor vertices. The condi- 
tional probability of the current vertex is defined following Bayesi- 
an inference: 



where the discrete likelihood p{x\x) can be directly derived from 
theMatrixN given different labels to x and update the p iocai O). The 
resulting quaternary distribution is in practice directly normalized 
without calculating the margin probability of the neighbor buildings 

p(x). 

2.2 Sample new labelsfor thefurther buildings 



withx^ the previously labeled neighbors and% the other neighbors. 
2.3 Calculate the overall energy K' (cf. Section 3.2) according to X'. 
3. Accept the new proposal with Metropolis-Hastings criterion 



with p(M \D ) the likelihood that the current model fi tting the data D, whose 
rati o can be represented by that of the overal I energy K . 

4 ^(5+1) - M < if acce pted, otherwise^ (s+1) = M s . 

The search stops when there is no more K improvement in the last 1000 
iterations with the assumption that the overal I energy converges. 




p(x\x) = 



Plocaljx) ■ p(x\x) 

p(x) 





4. Experiments 



Experiments are performed on data-sets of urban areas from the OSM and 
the official cadastral maps. Figure 6 shows the OSM data of one part of Bos- 
ton, USA, with 94 buildings. The manually labeled ground truth is given in 
Figurel(b). Figure6(c) presents a temporary labeling result based only on 
local geometric features: 68 out of 94 buildings (72.3%) are correctly identi- 
fied. Figure 1(d) shows the final labeling result considering both the local 
features and the contextual constraints. The classification accuracy is im- 
proved to 97.8%. 
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Figure 6. Example data of Boston, USA: (a) OSM data, (b) ground truth labeled 
manually, (c) labeling based on local features (unary energy) only, and (d) final 
labeling result considering both unary and binary terms. The incorrectly labeled 
buildings are highlighted with bold red contours in the results(c) and (d). 




Another example for the cadastral map can be found in Figure 7. There are 
456 buildings in total in this section of Hanover, Germany. Figure 7 (left) 
presents the final classification result with the accuracy of 89.7%. The ma- 
jority of the buildings are correctly labeled with the proposed method. The 
errors often happen in the classification of the commercial buildings, which 
is in many cases tricky as they have less distinct geometric characteristics 
(moderate values of EW and BD, cf. also Figure 4) than the other types. 
Commercial buildings have, therefore, more possibility to be mislabeled to 
the other buildings and vice versa. 

PI ease note that the classification is solely based on geometric and topologic 
criteria of the building footprint, i.e. shape and geographic context of the 
objects. The use of neighborhood information includes additional 
knowledge, which improves the classification based only on the characteris- 
tics of individual buildings- as shown in Figure 6. 

As a matter of fact there is an inherent uncertainty in the classification of 
building usage, which sometimes makes it difficult or even impossible also 
for a human to decide on the correct classification - even when additional 
knowledge is taken into account. E.g., are residential buildings with shops 
in the ground floor residential or commercial buildings? Is a train station 
with shops sti 1 1 a publ i c or a commerci al bui I di ng? 




Figure 7. Example data (cadastral map) of Hanover, Germany: the labeling result 
(left) and the ground truth (right). The incorrectly labeled buildings are highlighted 
with bold red contours in the result. 



5. Conclusion 



This paper presents an automatic labeling of building type (use and occu- 
pancy) solely based on the building footprint data. A category is predefined 
with four classes: residential, commercial, industrial and public. We pro- 
pose two new high-level geometric features: effective width and branching 
degree, which are designed to quantify the average living space and the 
structural complexity of the buildings, respectively. MRF is employed to 
model the network of buildings, in which the local geometric features are 
given to the vertices that represent the individual buildings while the con- 
textual constrai nts are embedded i n the edges that model the nei ghborhood 
relationship. The optimized labeling configuration is statistically searched 
by means of the Gi bbs sampler. The OSM or cadastral maps can thereby be 
enhanced with the predicted building usage information, which is derived 
from the existing geometric and topologic features. 

I n this work we have proposed a general and rather rough category with 
onlyfour types as our main goal isto explore the potential of using only the 
building footprint data. Please note that there is actually a wide variety of 
definitions for the building usage. More concrete classification or definition 
for specific purposes, e.g., the ten-classes building occupancy classification 
(International code council, 2006) primarily for the fi re code enforcement, 
could be used. However, then more non-geometric building attributes are 
required, because of the class definition like "educational" (schools up to 
the 12th grade) or "high-hazard" (places that product and store flammable 
or toxic materials). If additional knowledge is available, it can be included, 
e.g. one could start off with a more elaborate (supervised) classification, 
and i ncl ude more contextual knowl edge, such as knowl edge about the vi ci n- 
ity to other features, which might give an indication concerning the usage. 
An example would be the inclusion of the knowledge of a market square, 
which would increase the likelihood that the surrounding buildings are 
commercial. 

For the future work we first consider a new definition of neighborhood, 
which is essential totheMRF model. In this work we simply use the Voro- 
noi cells to defined the neighbors based only on the centroids of the build- 
ings (cf. Section 3.1). More sophisticated methods I ike the "constrained De- 
launaytri angulation" considering the object points and lines (Sester, 2005) 
can be employed to determine more reasonable neighborhood and thereby 
improve the labeling result. More experiments can be performed on larger 
urban areas/ wholecities with appropriate pre-partitioning. 

Furthermore, the introduced framework for building classification is easily 
extendable and adaptable: new attri butes/ measures can be added into both 



the unary and binary terms to improve the labeling performance or the user 
can select certain attribute^) corresponding to their specific task and defi- 
nition of building types. 
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